Generating Lexicalization Patterns for Linked Open Data
نویسنده
چکیده
The concept of Linked Data has attracted increased interest in recent times due to its free and open availability and the sheer of volume. We present a framework to generate patterns which can be used to lexicalize Linked Data. We use DBpedia as the Linked Data resource which is one of the most comprehensive and fastest growing Linked Data resource available for free. The framework incorporates a text preparation module which collects and prepares the text after which Open Information Extraction is employed to extract relations which are then aligned with triples to identify patterns. The framework also uses lexical semantic resources to mine patterns utilizing VerbNet and WordNet. The framework achieved 70.36% accuracy and a Mean reciprocal Rank value of 0.72 for five DBpedia ontology classes generating 101 lexicalizations.
منابع مشابه
A Multi-strategy Approach for Lexicalizing Linked Open Data
This paper aims at exploiting Linked Data for generating natural text, often referred to as lexicalization. We propose a framework that can generate patterns which can be used to lexicalize Linked Data triples. Linked Data is structured knowledge organized in the form of triples consisting of a subject, a predicate and an object. We use DBpedia as the Linked Data source which is not only free b...
متن کاملRealText-lex: A Lexicalization Framework for Linked Open Data
Linked Open Data (LOD) is growing rapidly as a source of structured knowledge used in a variety of text processing applications. However, the applications using the LOD need to be able to mediate between the front end user interfaces and LOD. This often requires a natural language interpretation of this structured, linked data. We demonstrate a middle-tier framework that can generate patterns w...
متن کاملLexicalizing DBpedia with Realization Enabled Ensemble Architecture: RealText-lex2 Approach
DBpedia encodes massive amounts of open domain knowledge and is growing by accumulating more triples at the same rate as Wikipedia. However, the applications often require natural language formulations of these triples to present the information as a natural text. The RealTextlex2 framework offers a scalable platform to transform these triples to natural language sentences using lexicalization ...
متن کاملMultilingual Question Answering over Linked Data (QALD-3): Lab Overview
The third instalment of the open challenge on Question Answering over Linked Data (QALD-3) has been conducted as a half-day lab at CLEF2013. Di↵erently from previous editions of the challenge, QALD-3 put a strong emphasis on multilinguality, o↵ering two tasks: one on multilingual question answering and one on ontology lexicalization. While no submissions were received for the latter, the former...
متن کاملTalmy’s Dichotomous Typology and Japanese Lexicalization Patterns of Motion Events
Talmy‘s (1985) crosslinguistic typology of lexicalization patterns of motion events have been extensively used in second language acquisition (SLA) research as a means to examine how second language (L2) learners map form, meaning, and function. These studies have yielded some conflicting results regarding the learnability of L2 lexicalization patterns arguably the oversimplification over and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015